Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1137820070280030355
ÀÇ°øÇÐȸÁö
2007 Volume.28 No. 3 p.355 ~ p.362
Prediction of Diabetic Nephropathy from Diabetes Dataset Using Feature Selection Methods and SVM Learning
Cho Baek-Hwan

Lee Jong-Shill
Chee Young-Joon
Kim Kwang-Won
Kim In-Young
Kim Sun-Il
Abstract
Diabetes mellitus can cause devastating complications, which often result in disability and death, and diabetic nephropathy is a leading cause of death in people with diabetes. In this study, we tried to predict the onset of diabetic nephropathy from an irregular and unbalanced diabetic dataset. We collected clinical data from 292 patients with type 2 diabetes and performed preprocessing to extract 184 features to resolve the irregularity of the dataset. We compared several feature selection methods, such as ReliefF and sensitivity analysis, to remove redundant features and improve the classification performance. We also compared learning methods with support vector machine, such as equal cost learning and cost-sensitive learning to tackle the unbalanced problem in the dataset. The best classifier with the 39 selected features gave 0.969 of the area under the curve by receiver operation characteristics analysis, which represents that our method can predict diabetic nephropathy with high generalization performance from an irregular and unbalanced dataset, and physicians can benefit from it for predicting diabetic nephropathy.
KEYWORD
diabetic nephropathy, feature selection, support vector machine, cost-sensitive learning
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI) ´ëÇÑÀÇÇÐȸ ȸ¿ø